首页> 外文OA文献 >Efficient multicore-aware parallelization strategies for iterative stencil computations

【2h】

Efficient multicore-aware parallelization strategies for iterative stencil computations

机译：迭代的高效多核感知并行化策略模板计算

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Stencil computations consume a major part of runtime in many scientificsimulation codes. As prototypes for this class of algorithms we consider theiterative Jacobi and Gauss-Seidel smoothers and aim at highly efficientparallel implementations for cache-based multicore architectures. Temporalcache blocking is a known advanced optimization technique, which can reduce thepressure on the memory bus significantly. We apply and refine this optimizationfor a recently presented temporal blocking strategy designed to explicitlyutilize multicore characteristics. Especially for the case of Gauss-Seidelsmoothers we show that simultaneous multi-threading (SMT) can yield substantialperformance improvements for our optimized algorithm.

机译：模板计算在许多科学仿真代码中消耗了运行时的主要部分。作为此类算法的原型，我们考虑迭代式Jacobi和Gauss-Seidel平滑器，并针对基于缓存的多核体系结构的高效并行实现。 Temporalcache阻塞是一种已知的高级优化技术，可以显着降低内存总线上的压力。我们为最近提出的时间阻塞策略应用并优化了此优化，该策略旨在显式利用多核特性。特别是对于高斯-塞德尔斯moothers的情况，我们证明了同时多线程（SMT）可以为我们优化的算法带来实质性的性能提升。

著录项

作者
Treibig, Jan; Wellein, Gerhard; Hager, Georg;
展开▼
作者单位

展开▼
年度 2010
总页数
原文格式 PDF
正文语种 {"code":"en","name":"English","id":9}
中图分类

相似文献

外文文献
中文文献
专利

1. Efficient multicore-aware parallelization strategies for iterative stencil computations [J] . Jan Treibig, Gerhard Wellein, Georg Hager Journal of computational science . 2011,第2期

机译：高效的多核感知并行化模板迭代计算策略
2. Strategy for data-flow synchronizations in stencil parallel computations on multi-/manycore systems [J] . Szustak Lukasz Journal of supercomputing . 2018,第4期

机译：多/多芯系统上模板并行计算中数据流同步的策略
3. Hierarchical parallelization and optimization of high-order stencil computations on multicore clusters [J] . Hikmet Dursun, Manaschai Kunaseth, Ken-ichi Nomura, Journal of supercomputing . 2012,第2期

机译：多核集群上的高阶模版计算的分层并行化和优化
4. Multicore-aware parallel temporal blocking of stencil codes for shared and distributed memory [C] . Wittmann M., Hager G., Wellein G. 2010 IEEE International Symposium on Parallel Distributed Processing, Workshops and Phd Forum . 2010

机译：模板代码的多核感知并行时间阻塞，用于共享和分布式内存
5. Towards Automatic Compilation for Energy Efficient Iterative Stencil Computations [D] . Zou, Yun. 2016

机译：朝向节能迭代模板计算的自动编译
6. Handling Big Data in Medical Imaging: Iterative Reconstruction with Large-Scale Automated Parallel Computation [O] . Jae H. Lee, Yushu Yao, Uttam Shrestha, -1

机译：在医学成像中处理大数据：大规模自动并行计算的迭代重建
7. Multicore-aware parallel temporal blocking of stencil codes for shared and distributed memory [O] . Markus Wittmann, Georg Hager, Gerhard Wellein 2016

机译：用于共享和分布式存储器的模板代码的多核感知并行时间阻塞
8. MiniGhost: A Miniapp for Exploring Boundary Exchange Strategies Using Stencil Computations in Scientific Parallel Computing. [R] . Barett, R. F., Vaughan, C. T., Heroux, M. A. 2012

机译：miniGhost：miniapp用于在科学并行计算中使用模板计算探索边界交换策略。

Efficient multicore-aware parallelization strategies for iterative stencil computations

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅